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ON THE DEVELOPMENT OF [p + (1 —p)T- 



BY E. L. DE FOEEST, M. A., WATEETOWN, CONN. 

It is well known that when p and q denote respectively the probabilities 
that an event will happen and that it will fail to happen, in a single trial, 
so that p -f- q = 1, then if m trials are to be made independently of each 
other, the m+1 terms of the development 

(p+qr =p m +^p m -lq+^=V p <»-*q*+ .... +q m , (1) 

are the several probabilities that the event will happen any number of times 

from m down to 0, and that it will fail to happen a corresponding number 

of times from up to m. (See for instance Merriman's Least Squares, p. 123.) 

The terms of the series increase in value from either extremity toward an 

intermediate point where they attain a maximum. If we regard them as 

equidistant ordinates to a curve, then as m is increased, the form of this 

curve approximates rapidly to the probability curve 

— hV 
y — ce , 

and this is found to be the limiting form, when m becomes infinite. I pro- 
pose here to demonstrate it in a manner rather more simple than any I have 
met with, and also to consider some points not usually noticed in connection 
with it. 

Let the terms of the development be denoted by y ,yi, y 2 Ac, so that 

(p+q) m = Vo +Vi +y> + • • • • +y- 
Comparing this with (1) we see that 

y-i— .Vo — 9 m _i y-i—y\ _ q(m—i) _ 1 y z —y i _ g{m—2) 1 

Vo P ' yi 2p ' y 2 3p ' 

and in general Vn-i—Vt — gfo-*)_ lt ( 2 ) 



—66— 

When y, is denoted simply by y, and the constant interval between two 
consecutive ordinates is dx, and the origin is taken at the first term, then 

and (2) reduces to 

Ay _ (qm—p)dx—x , g s 

y p{x-\-dx) 

The largest term in the series (1) is that term whose rank is i = qm, as 
can be ascertained by trial in any particular case where qm is a whole num- 
ber, and also by considering that the term in question is then 

/"(m+1) n*""/,?™ 

r{pm+l)f\qm+iy * ' 

being the probability that the event will happen pm times and fail qm times, 
while the terms next preceding and following are 

r{pm+2)r{qm) K y ' r{pm)r(qm+2f H * 

Dividing each of these two by the former, we find the quotients 

P™ , and 9 1 " , 
pm-\- 1 qm -J- 1 

both of which are less than unity. We can always make qm a whole num- 
ber by taking m sufficiently large. 

In (3) let the origin be transferred from the first term in the series to the 
greatest term, by putting x-\-qmdx in place of x; then 

Ay _ — (x+pdx) ^ 

y p{qmdx+x-\-dx)' 
If m becomes infinite, and the successive ordinates are set close together 
so as to be consecutive, then at the limit dx and Ay become dx and dy, and 
dx and pdx vanish in comparison with x, so that (4) reduces to 

^ = =*_ (5) 

y p(qmdx + x) 

To find the equation of the curve from this by integration, we must ob- 
serve that the rules of the integral calculus presuppose that a differential is 
infinitely smaller than the function from which it is derived. If the nu- 
merator dy in the first member is to be infinitely smaller than the denomi- 
nator y, it will follow that in the second member, the numerator x must 
be infinitely less than qmdx-\-x, and consequently the x in the denominator 
is infinitely less than qmdx, and may be neglected, so that we may write 



dy —x 

y pqmdx' 



(6) 



r, (7) 
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But since x is finite, it cannot be infinitely less than pqmdx unless mdx 
is infinite, and this cannot be unless m is an infinity of the second order, 
that is, of a magnitude comparable to l-i-(dx) 2 . To make it such, we must 
suppose that the series extends throughout the whole infinite length of the 
axis of X. For dx is the interval between any two consecutive terms or or- 
dinates, and it is contained co times in a finite distance, say the unit of x, 
and this unit is contained cso times in the whole axis of X, so that dx is con- 
tained 00 s times in the whole axis, and thus the whole number of terms in 
the series becomes m+1 = oo 2 -fl Le (6) then be written 

dy — xdx 

y l pqm{dx) 

The denominator of the second member does not vary with x, and m(dx) 2 
is a constant and finite area. We will express the denominator by means 
of a new constant h, so that 

h = l+[dxi/(2pqm)], (8) 

and (7) becomes 

%L=—2Wxdx, 

y 

—h 2 x 2 

and integration gives log'y = — h 2 a?+log'c; . • . y = oe (9) 

Since p+q = 1, we have (p-\-q) m = 1, and the sum of all the terms in 
the series (1) is unity, wherefore the sum of all the consecutive values of y 
in the curve (9) is also unity, if that curve fully represents the limit of the 
series. Multiplying the sum of the ordinates, and unity, each by dx, we 
have the condition 

/at nax — %t x t 

ydx = dx; . * . c I e dx = dx. (10) 

The known value of the definite integral here is j/jt-s-A. Thus (10) de- 
termines the constant 

= hdx-7-i/x, (11) 

and substituting it in (9), we get 

hdx — A 2 * 2 , 1<n 

y = ^e , (12) 

the final equation of the limiting curve. We infer from (8) that h is the 
reciprocal of a finite line, so that JPx 2 is an abstract number, and finite as 
long as a; is finite. It follows that y is an abstract number, as a probability 
should be, and is an infinitesimal of the first order. 

Let i now be reckoned as x is, from the place of the maximum probabil- 
ity. As before, x = iJx and at the limit x = idx. Substituting in (12) 
this value of x and also the value of h from (8), we find 
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— i — i* 
y — (2npqm) e (13) 

When an infinite number m of trials are made, the most probable result, 
as we have seen, is that the event will happen pm times and fail qm times, 
and (13) gives the probability that it will happen exactly pm-\-i times and 
fail qm — i times. This formula holds good approximately also when m is 
a large finite number, provided that i is considerably smaller than pm or 
qm, a proviso which results from the fact that m in the formula is an infin- 
ity of the second order, while i is an infinity of the first erder, since i = x 
-s-dx, and x is finite. 

Since y is the probability that the number of times the event happens 
will deviate from the most probable number pm by a certain number i, we 
may regard x = idx as a deviation of position, or error of position, and re- 
gard y in general as the probability that any observed quantity, subject to 
accidental causes of error, will deviate from its most probable value by a 
certain quantity or error x. Equation (12) then is the equation of the prob- 
ability curve, expressing the law of facility of error. 

Let both its members be divided by dx, which is equivalent to multiply- 
ing them by a constant and infinitely large quantity. Denoting y-i-dx by 
Y. we have 

Y := —e , (14) 

and since y is an infinitesimal of the first order, Y is finite. This is what 
is commonly known as the equation of the probability curve, but it is evi- 
dent that here the probability of the error x is not the ordinate F, but the 
elementary area Ydx — y. The whole area of the curve is 



r. 



Ydx = 1. 



The probability that any error which occurs, when taken without regard 
to sign, will not exceed a certain limit x, is the sum of the probabilities of 
all the errors from — x up to -\-x, and equals the area of the curve between 
those limits, or 

P=Af e~ hHi dx. (15) 

If we put hx = t, then dx = dt-t-h, the limits x=±x become t = ±hx, 
and Y being a function of x 2 the curve is symmetrical on each side of the 

origin, so that 

° Of t 2 

P=JL-\e dt. (16) 

V* J o 
All works on probability contain the values of this integral arranged in 
a table, enabling us to find P for any given value of the argument t = hx. 
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It follows from (8) also that 

t = t-s-/(2pgm), (17) 

so that such a table will give the sum of 2i values of y in (13), being the 
approximate probability that when a large finite number m of trials are 
made, and i is an integer or half-integer much smaller than pm or qm, the 
number of times the event will happen will be one of the group of 2i con- 
secutive integers the middle of which falls nearest to the number pm. 

We have hitherto supposed the constant h to be determined from (8) in 
terms of m, but in applying the theory of probability of errors it generally 
happens that there are given only a certain finite number of observed errors 
or deviations from the true value of a quantity or from some value which is 
regarded as the most probable one, while the whole number m of different 
possible errors is infinite or at least unknown. To find h from the observed 
errors, the best course generally is to compute the "mean error", or as it has 
been more properly called, the quadratic error, which is defined to be the 
square root of the mean of the squares of all the observed errors. In the 
most typical case, when the number of observed errors is very large or in- 
finite, the square of the quadratic error is the quotient arising from dividing 
the sum of the squares of all the errors by the whole number of errors, 
that is 

s 2 r= f" x*Ydx-±- f" Ydx, (18) 

each error x being supposed to occur a number of times proportional to its 
probability Ydx. The value of the divisor here is unity, as we have seen. 
The dividend is 

This last definite integral is known to have the value Jv /jr - The quad- 
ratic error then is 

£ = ls-h l /2; . • . h = l-^ £l /2. (19) 

Since e 2 is the mean of all the observed values of x 2 , and x = idx, it fol- 
lows that (e -¥■ dx) 2 is the mean of all the observed values of t 2 . Substitu- 
ting in (19) the value of h from (8), we get 

e = dx ^/(pqm); . ' . # s-~dx = -y/(pqm). (20) 

This is the square root of the mean of the values of i 2 in (13), bearing the 
same relation to deviations i from the most probable number pm, as e bears 
to deviations x from the most probable value of any observed quantity. It 
is of use in statistical investigations. (Smithsonian Report of 1873, p. 334.) 

The constant h has been called the "measure of precision", because it va- 
ries inversely as the quadratic error ; but in point of fact the accuracy of a 
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set of observations is usually estimated by the probable error, denned to be 
that error r, which occupies an intermediate position among all the observed 
errors taken without regard to sign, so that for an infinite number of errors 
the number greater than r is equal to the number less than r, and the prob- 
ability that any error which occurs will be, for example, less than r, is §. 
By interpolation in a table of values of P, the value of t = hx correspond- 
ing to P = £ is found to be 

hr = 0.4769, 

and substituting the value of k from (19) we get 

r = .6745e, (21) 

by which the probable error is computed from the quadratic error. We al- 
so have by (20) 

r-hdx = .6745i/(pqm), (22) 

which is the probable or intermediate value of i, so that it is an even chance 
that the event will happen a number of times not deviating from the most 
probable number pm by more than this probable value, when a large num- 
ber m of trials are made. 

It should be noticed, however, that the series is in general symmetrical 
on both sides of the maximum probability or axis of Y, only when m is 
infinite. If m is finite, the terms of the series are not identical on opposite 
sides of that axis, and the probable error or deviation is somewhat greater 
on one side, and less on the other, than the value which (22) gives. It is 
greatest on that side which is toward the more remote end of the series. 
But in the special case where p = q = J, the maximum term is in the mid- 
dle of the series, and the probable errors are alike on both sides, whether m 
be infinite or finite. One of the usual elementary proofs of the law of prob- 
ability demonstrates it as the limit of the development of (J-f £) Jn . (See for 
instance Price's Calculus, Vol. II., p. 376.) 

Let us now return and consider the validity of our treatment of formula 
(5), from which (6) was obtained by neglecting the finite x in comparison 
with the infinite qmdx, where mdx represents the whole length of the axis 
of X, or the whole distance occupied by the series (1) when m is an infinity 
of the second order. It is evident, in the first place, that dy is not infinitely 
smaller than y in every part of the series. On the contrary, it is sometimes 
of a magnitude equal or comparable to that of y, and sometimes even infi- 
nitely greater than y, as appears from (5). 

When we put successively x = — qmdx -f a, x = pmdx — o, where a is 
finite and represents distance from either end of the series, (5) becomes 

dy qmdx — a ■, dy _ / pmdx — a \ 

y "~~ ap y \pmdx — apl' 
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As mdx is infinite, dy-i-y is infinite in the one case and —1 in the other, 
showing that, of two consecutive ordinates at any finite distance from either 
end of the series, that one which is nearest the end is infinitely smaller than 
the other one. Therefore, in going from either end toward the middle, the 
ratio of dy to y passes from infinity at any finite distance from the end, 
through intermediate and finite values, down to infinitesimal values for any 
finite distance from the point where y is a maximum ; and it is only to this 
latter region that formula (6) applies. Have we a right, then, to say that 
the terms outside of this region are so small that their sum is infinitesimal, 
and that the sum of the terms within it is therefore equal to unity? For 
we assumed this, when we determined the constant c in (9) by the condition 
(10). It would hardly seem to be self-evident, but there are facts which 
confirm its truth, and one of them is reached by computing the quadratic 
error for a finite series, as follows. 

In (1) let p, q and m have any consistent values, for instance 

P = h 1 = h rn = 4, 
so that ( p +q) m = ^(1+8 + 24+32+16). 

The second member is a series of five fractions which are the probabili- 
ties of the five different possible results, and may be represented by equi- 
distant ordinates, the constant interval between them being Ax. Let the 
origin be taken at the point whose distance from the first ordinate is qmAx 
= f Ax. This point falls two thirds of the way from the term 24 to the 
next term 32. The negative errors or deviations then are 

— %Ax, —%Ax, —%Ax, 
and the positive ones are \dx y \Ax. 

The square of the quadratic error is the mean of the squares of all the 
errors, each one being supposed to occur a number of times proportional to 
y, so that 

,2 = ■Ml(f) 2 +8(f) 2 +24(3)*+32q)*+16(4)']W 
^(1+8+24+32+16) " ' 

from which we find 

e* = |( Axf = pqm{ Axf. 

In like manner, for all other consistent values of p, q and m which have 
been tried, it has been found that the quadratic error is invariably Axi/(pqm). 
Hence we infer that at the limit, when m is an infinity of the second order, 
and Ax is reduced to dx 

e = dx V(pqm). 

But this is the value which has already been found in (20) by integrating 
from the probability curve. In other words, the summation of a?y for the 
whole of the infinite series gives the same result as its summation from the 
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curve (12) which can represent truly only a portion of the series; while the 
sum of y is unity in both cases. We conclude then that in all those portions 
of the infinite series to which (6) and (12) do not apply, the terms y are so 
small that the total of x*y, and consequently the total of y, is infinitesimal. 

This method of finding the expression (20) for the qudaratic error induc- 
tively from the finite series is one which I have not seen in any of the pub- 
lished works on probability. 

I take this opportunity to make a correction in my article on the Limit 
of Repeated Adjustments (Analyst, Sept. 1878), where the curve, whose 
ordinates y are the coefficients of the limiting resultant formula of adjust- 
ment, was shown to be a probability curve, or a higher curve of analogous 
nature, and the condition that the sum of all the consecutive ordinates must 
be unity was employed in determining the constant parameters of the curves. 

This condition was regarded as being expressed by 



/: 



ydx = 1, 

— —to 

whereas it should have been 

i r m 

rJ J dx = l > 

as we have seen in connection with formula (10) of the present paper. The 
consequence is, that the expressions for y obtained in formulas (10), (14) and 
(19) of the article referred to, are really expressions for Y—y-i-dx, and the 
numerical values given for y in Tables I. and III. are really values of Y. 
The correction is one that affects the theory of the matter rather than the 
practical results obtained. It has been a fault of long standing with wri- 
ters on the probability curve, to slur over the distinction between the infin- 
itesimal probability here denoted by y = Ydx, and the finite ordinate Y 
which is proportional to y. When y is regarded as an ordinate, it belongs 
to a curve whose total area is dx; but Y belongs to one whose area is unity. 

It may be well to add a remark upon the three formulas (6), (7) and (8) of 
the article referred to, where h represents the number of applications of the 
adjusting process, and is supposed to become infinite at the limit. The infin- 
ity in these three cases must be of the second, fourth and sixth orders respect- 
ively, as appears from considerations similar to those arising in formula (6) 
of the present paper, where m becomes an infinity of the second order. There 
cannot well be any limit to the increase of an abstract number. If the num- 
ber co is a supposable quantity, then oo 2 , <x> 4 and oo 6 are supposable also. 

For example, if dx is contained <?o times in the unit of x, and this unit 
of a; is contained cv> times in the whole axis of X, then dx is contained co 2 
times in the whole axis of X, and (dxf is contained oo 4 times in the infinite 
plane XY, and (dx) z is contained <x> 6 times in the infinite space XYZ. 
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The concrete result in the three cases of adjustment appears to be this, 
that for a finite number of applications, the limiting form is approached more 
rapidly in the first case than in the second, and more rapidly in the second 
than in the third. Thus under the first case, p. 133, the coefficients for only 
16 applications approach the limiting form more nearly than the coefficients 
for 32 applications do under the second case at p. 138, the limit being ap- 
parently farther off in the latter case. 

Note. — When a series is adjusted by two formulas successively, as at p. 
66 of Analyst for May 1878, where the two are denoted by 

«'o = *0 W 0+ *l(«l+«-l)+ h( U 2+ U -i)+"'+ 4.(«.»+W-m), 

< =L Q u' +L 1 (u\ +uL 1 )+L 2 (u' 2 +uL 2 )+ ...+£„ K "K. ), 

the coefficients in the resultant formula are the same as those which belong 
to the several powers of x in the product of the two polynomials 

L n +£„_ 1 x+L n _ i x 2 + . . . +L x n +L 1 x»+ * + . . . + L n x* \ 

This property, which I did not perceive at first, systematizes the matter 
and simplifies it to the mind, although it makes no difference in the actual 
amount of computation to be done, in forming the resultant coefficients. 

Since the resultant for k successive adjustments by one formula has the 
same coefficients as the corresponding polynomial raised to the k power, we 
are enabled to demonstrate the proposition (3) at p. 129 of the Analyst for 
Sept., 1878, by means of the Multinomial Theorem, as I will take occasion 
to show hereafter. The demonstration includes the general case where the 
adjustment formula is not symmetrical on each side of the middle, so that 
l r and L* are not necessarily equal. 



GENERAL RULES FOR THE FORMATION OF MAGIC 
SQUARES OF ALL ORDERS. 



BY PROP. A. B. NELSON, CENTRE COLLEGE, DANVILLE, KY. 

The following general rules for the formation of magic squares, whether 
of odd or even degree, may be new to the younger generation of mathe- 
maticians, and possibly to some others. The squares constructed by these 



